OcrV1, Main, Exploration, bibRecord, 001612

Text detection and recognition in images and video frames

Identifieur interne : 001612 ( Main/Exploration ); précédent : 001611; suivant : 001613

Text detection and recognition in images and video frames

Auteurs : D. Chen [Suisse] ; J. M. Odobez ; H. Bourlard

Source :

Pattern Recognition [ 0031-3203 ] ; 2004.

RBID : Pascal:04-0085114

Descripteurs français

Pascal (Inist)
- Théorie, Analyse image, Codage image, Traitement texte, Algorithme, Extraction caractéristique, Expérience.

English descriptors

KwdEn :
- Algorithms, Broadcast documents, Experiments, Feature extraction, Image analysis, Image coding, Machine learning text verification, Text processing, Theory.

Abstract

This paper presents a new method for detecting and recognizing text in complex images and video frames. Text detection is performed in a two-step approach that combines the speed of a text localization step, enabling text size normalization, with the strength of a machine learning text verification step applied on background independent features. Text recognition, applied on the detected text lines, is addressed by a text segmentation step followed by an traditional OCR algorithm within a multi-hypotheses framework relying on multiple segments, language modeling and OCR statistics. Experiments conducted on large databases of real broadcast documents demonstrate the validity of our approach. © 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.

Affiliations:

Suisse

Links toward previous steps (curation, corpus...)

to stream PascalFrancis, to step Corpus: 000570
to stream PascalFrancis, to step Curation: 000220
to stream PascalFrancis, to step Checkpoint: 000455
to stream Main, to step Merge: 001666
to stream Main, to step Curation: 001612

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" level="a">Text detection and recognition in images and video frames</title>
<author><name sortKey="Chen, D" sort="Chen, D" uniqKey="Chen D" first="D." last="Chen">D. Chen</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Dalle molle Inst.Perceptual Artif.</s1>
<s2>Martigny, CH 1920</s2>
<s3>CHE</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Suisse</country>
<wicri:noRegion>Dalle molle Inst.Perceptual Artif.</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Odobez, J M" sort="Odobez, J M" uniqKey="Odobez J" first="J. M." last="Odobez">J. M. Odobez</name>
</author>
<author><name sortKey="Bourlard, H" sort="Bourlard, H" uniqKey="Bourlard H" first="H." last="Bourlard">H. Bourlard</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">INIST</idno>
<idno type="inist">04-0085114</idno>
<date when="2004">2004</date>
<idno type="stanalyst">PASCAL 04-0085114 EI</idno>
<idno type="RBID">Pascal:04-0085114</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000570</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000220</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000455</idno>
<idno type="wicri:doubleKey">0031-3203:2004:Chen D:text:detection:and</idno>
<idno type="wicri:Area/Main/Merge">001666</idno>
<idno type="wicri:Area/Main/Curation">001612</idno>
<idno type="wicri:Area/Main/Exploration">001612</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en" level="a">Text detection and recognition in images and video frames</title>
<author><name sortKey="Chen, D" sort="Chen, D" uniqKey="Chen D" first="D." last="Chen">D. Chen</name>
<affiliation wicri:level="1"><inist:fA14 i1="01"><s1>Dalle molle Inst.Perceptual Artif.</s1>
<s2>Martigny, CH 1920</s2>
<s3>CHE</s3>
<sZ>1 aut.</sZ>
</inist:fA14>
<country>Suisse</country>
<wicri:noRegion>Dalle molle Inst.Perceptual Artif.</wicri:noRegion>
</affiliation>
</author>
<author><name sortKey="Odobez, J M" sort="Odobez, J M" uniqKey="Odobez J" first="J. M." last="Odobez">J. M. Odobez</name>
</author>
<author><name sortKey="Bourlard, H" sort="Bourlard, H" uniqKey="Bourlard H" first="H." last="Bourlard">H. Bourlard</name>
</author>
</analytic>
<series><title level="j" type="main">Pattern Recognition</title>
<title level="j" type="abbreviated">Pattern Recogn.</title>
<idno type="ISSN">0031-3203</idno>
<imprint><date when="2004">2004</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><title level="j" type="main">Pattern Recognition</title>
<title level="j" type="abbreviated">Pattern Recogn.</title>
<idno type="ISSN">0031-3203</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>Algorithms</term>
<term>Broadcast documents</term>
<term>Experiments</term>
<term>Feature extraction</term>
<term>Image analysis</term>
<term>Image coding</term>
<term>Machine learning text verification</term>
<term>Text processing</term>
<term>Theory</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr"><term>Théorie</term>
<term>Analyse image</term>
<term>Codage image</term>
<term>Traitement texte</term>
<term>Algorithme</term>
<term>Extraction caractéristique</term>
<term>Expérience</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">This paper presents a new method for detecting and recognizing text  in complex images and video frames. Text detection is performed in a two-step approach that combines the speed of a text localization step, enabling text size normalization, with the strength of a machine learning text verification step applied on background independent features. Text recognition, applied on the detected text lines, is addressed by a text segmentation step followed by an traditional OCR algorithm within a multi-hypotheses framework relying on multiple segments, language modeling and OCR statistics. Experiments conducted on large databases of real broadcast documents demonstrate the validity of our approach. © 2003 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.</div>
</front>
</TEI>
<affiliations><list><country><li>Suisse</li>
</country>
</list>
<tree><noCountry><name sortKey="Bourlard, H" sort="Bourlard, H" uniqKey="Bourlard H" first="H." last="Bourlard">H. Bourlard</name>
<name sortKey="Odobez, J M" sort="Odobez, J M" uniqKey="Odobez J" first="J. M." last="Odobez">J. M. Odobez</name>
</noCountry>
<country name="Suisse"><noRegion><name sortKey="Chen, D" sort="Chen, D" uniqKey="Chen D" first="D." last="Chen">D. Chen</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001612 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001612 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:04-0085114
   |texte=   Text detection and recognition in images and video frames
}}

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024

	Serveur d'exploration sur l'OCR
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur l'OCR

Text detection and recognition in images and video frames

Text detection and recognition in images and video frames

Source :

Descripteurs français

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri